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ABSTRACT 

The theoretical differences between the traditional 
definition of dimensionality and the more recently defined notion of 
essential dimensionality are presented. Monte Carlo simulations are 
used to demonstrate the utility of W. F. Stout's procedure to assess 
the essential unidimensionality of the latent space underlying a set 
of terms. The' traditional definition of dimensionality makes no 
distinction between ma^or and minor dimensions. It is desirable to 
count only dommanc dimensions m psychometric assessment of 
dimensionality of a latent space, stout (1987, 1988) has provided a 
definition known as essential dimensionality and has developed a 
statistical test to assess the essential unidimensionality of a set 
of Items. Monte Carlo simulation studies with two examinee sizes, 
n=750 and n=2,000, were generated with item parameters resembling 
those of real tests. Both simulra.^ons exhibited good performance m 
assessing essential unidimensionality. Since applications of item 
response theory techniques are becoming increasingly popular m most 
educational assessment, procedures to assess unidimensionality 
accurately will greatly facilitate the use of item response theory. 
Four tables provide data from the simulations. (SLD) 
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Abstract 



This paper points out the theoretical differences between the traditional 
dimensionality and essential dimensionality, empirically demonstrating through Monte 
Carlo simulationb the utility of Stout's procedure to assess, statistically, essential 
unidimensionality underlying a set of items. 
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Presently, most item response theory (IRT) based res^^arch assumes one—, two-, or 
three— parameter iopistk or normal ogive models. The two most cri ical assumptions 
underlying these models are "unidimensionality'' and "local independence." In other ^^ords. 
these models can be apphed to estimate an individual's latent ability provided the test is 
measuring only one ability or dimension: and for a given ability level, examinees' responses 
to different items are independent. The intent of this paper is to present the differences 
between the traditional definition of dimensionality and the more recently defined notion of 
essentiaJ dimensionality, and to em.pirically demonstrate, through Monte Carlo 
simulations, the utility of Stout's procedure to assess essential unidimensionality of the 
latent space underlying a set of items. 

Definition of Dimensionality 

Let Uj denote the i-th item, and Uv^- E (U^^Uo-.-Ut^-), the test. Observed values of 
item and test will be denoted by U| and u^: = (u j.Uo---U|^0 respectively. Let U- = 1 denote 
a correct response and = 0 denote an incorrect response to item i for a randomly chosen 
examinee. The latent random vector is denoted by B cind the particular values it takes is 
denoted by ^. Let T-^{£] denote the probability that a randomly chosen examinee with 
ability ^ will get the i— th item correct. Let X = -jUj, i>l[ denote an item pool. 

According to the classicaJ notion of dimensionality (Lord k Novick, 196S) if k traits 
influence n items in a test, tha^ is, each of the k traits influence examinees' performance on 
at least one item in the test, then k is the dimtiisionality of the latent space. These k traits 
represent k psychological dimensions and define a k— dimensional complete latent space. 

The above definition, although mathematically precise, makes no distinction 
between major and minor dimensions. It has been argued that items axe inherentlv 
multiple determined and it is not uncommon to find more than one ability influencing 



relatively items or is unique to individuad items in a test. These insignificant attributes 
should not he included in psychometric assessment of dimensionality (Humphreys, 19S4). 
Alore over, there could be other factors such as test anxiety, motivation, ability to work 
quickly etc., in addition to the main dominant dimension that influence the test 
performance of an examinee i Hambleton k Swaminathan, 1955). Hence it is desirable to 
count only dominant dimensions in psychometric assessment of dimensionality of a latent 
space. 

Exampie : Consider the construction of a paragraph comprehension test of n items. 
Let n = 5r, denote the total number of items in a test where 5 denotes the number of 
paragraphs and r denotes tlie number of questions following each paragraph. Let the first 
paragraph be about British history, the second about the Second World War. the third 
about thf Sahara desert, the fcurth about classical music, and the fifth paragraph about art 
history. 

Clearly in the above example reading is the common ability influencing all the 
items. In addition each paragraph is influenced by one other ability, namely British 
history in the first paragraph, events in the Second Woiid Wax in the second paragraph, 
3Sid suon. Clearly, according to the traditional definition of dimensionality, the number of 
dimensions underlying thr above set of items is 6. wherein it is desirable to assess the 
dimensionality as 1. namely reading, which is th'^ dominant dimensir-n. 

Although it has been implicitly assumed that only dominant dimensiuns be 
considered in assessing the dimensionality of a latent space and several procedures based on 
factor analytic notion have been developed to assess the dimensionality of a latent space, 
until recently, there has been no precise definition as to what is meant by dominant 
dimension. Stout (19S7, 1938), for the first time, has provided such a definition known as 



fssaitial d:m^n'^ionality which flows from the definition of essential independence as 
opposed to traditional assumption of local independence. Stout fl9S7) also has provided a 
statistical test procedure ,to assess essential unidimensionality of a set of items. 
Xandakumar (10S7) has further refined Stout's procedure. Stout (in press) argues that 
traditional assumptions of unidimensionality and local independence can be replaced by 
respective weaker and psychometrically more appropriate assumptions: essential 
unidimensionality and essential independence. Junker (1988! has proved mathematical 
results linking local independence and essential independence, and has also provided 
procedures for consistently estirp.ating ability under essential independence. 

Definition 1 (Stout, 1988). The item pool X is said to be essentiahv independent (El) 
with respect to latent variables B if X satisfies 



y Ti^'/vv |Cov(U..U.j8 = ^)! 
D.Ji) ^1<^<J<^ ! ^Lb 0 as N - cu. 



The distinction between LI and EI is th?t LI requires Cov djT'j j 8 = i) = 0 for all ^ , 
whereas EI requires that the a\erage Cov f ^Y^'j i- ^ small in magnitude (for all £] 
a.syniptotiraliy. Hence Ei is a weaker assumption than LL 



Definition 2 (Stout, 19S8i. The '^^'^'vtinl dimnifiionality {(\^) of an item pool X is the 
minimal dimensiunality necessar\ to sati5^^ the assumption of EI. When d-g = L essential 
dlmensionalitv is said to hold. 
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Method 



Monte Carb singulation studies are used to denmonstrate the effectiveness of Stout's 
procedure (Stout. 19S7: Nandakunmar 19S7) to assess essential unidinmensionality. Two 
types of tests, as described below, are considered \\here they are essentially unidimensional 
}d£ = l) hut require more than one single ability in order to correctly answer the items. In 
both cases a bivaiiate extension of the unidimensiond] three parameter logistic model with 
compensatory abilities (Equation 1) is used to generate item responses with uncorrelated 
abilities. Two examinee sizes: J=750 and J=2000 are simulated in each case. The item 
parameters are generated to resemble those of real tests. The means and the standard 
deviations of simulated items are like those of real tests obtained from different sources in 
(he literatu/e. 




(i^,-b,.)+a...(M-b./)J} 



This cast' is arialoguus to paragraph comprehension test. The n items of a test are 
s})lit into h groups vshere each group of items is influenced h\ two abilities. Out of thest^ 
two abilities, one abilit\ is cjmmon to all items of the test (example, reading), and the 
utlier ability is unique to each grour (example, art history, second world war etc.). In aJ] 
thert' ar.^ j+h abilititis and each itt^ni is influenced b\ two abilities. The degree oi influence 
ol rumnivra ability is considered major and the degree of influence of second ability is 
cuiisidert'd minor. Tht^ item disciiminating paraiiRters are generated such that their means 
and \ariaiK>s are a function of the degree to which tht^y influence the trait; and the ratios 
of means uf a^ and a,,, and tiie ratio oi variances oi a^ and a , are the same. For instance, if 



the contribution of the major ability 15 .6 and the contribution of the niinor ability 15 J. 
then the means and variances of a^ and a are a function of the degree of contribution to 
their respect^-e abilities. The contribution of minor ability is denoted as ConaJ. The 
contribution of major ability is 1— 'Cona2. Abilities are generated from standard normal 
distribution. 
ase J 

This is different from Ca^e 1 in that there are only two abilities one major and the 
other minor and the ba.\.e two abilities influence all items. The item and ability parameters 
are generated as in the case of Case 1. 



Results 



The simulation results shown in Table 1. 2. and 3 exhibit good performance of 
Stcut's procedure in assessing essential unidimensiondlity. Each cell of Table 1 displays 
rejection rate for the Case 1 with two items per paragraph, per 100 repeated trials of 
Stout's proceaure. on simulated tests that resembles conceptually that of a paragraph 
comprehension test. According to the asymptotic theory when ^£=1. one expects on the 
average about oV{ rejections. Each cell of Table 2 displays rejection rate per 100 repeated 
trials for the Case 1 with five items per paragraph. In other words each minor abili'y 
influences 5 items. Coiiten*"^- cf Table 1 and Table 2 indicate that Stout^s procedure is able 
to establish essential unidimensionaJity. For short tests of 25 or 30 items however, r=5 
amounts to one fifth of the test being contaminated. Stout's procedure in such a case 
assesses it as not essentially unidimensional. Table 3 displays rejection t-ates for the Case J, 
where only two abilities, ont major and one minor, influence all irems of the test. The 
results of Table 3 are in accordance with what is desirable. These results can be contrasted 
wirh those of a typical two-<iimensional case as displayed in Table 4 1 Stout. 19S7). where a 
test consists of two 
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major abilities influencinf, all items. The results of Table 4 indicate that Stout's procedure 
maintains very good power when the correlation between abilities is as high as 0,5. 

Educational Implications of the Study 

Since applications o^'' IRT techniques are becoming increasingly popular in most 
educational assessment procedures, uncritical use of IPtT models can have grave 
consequences in estimating and interpretations of educational outcomes, Unidimensionality 
being one of the critical assumptions of IRT modeling, procedures to accurately assess 
uriidimensionality will greatly facilitate the use of IRT in both theoretical as well as 
applied research. 
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Tabic 1 



Rejection Rates per lOU Trials for the Ca^e 1* 


One Maior 


and Several Alinor 


abilities influence test items, dr. = 1. Conal?^ 


= V.2. r = 


■2. a = m 




J 


TESTS 




n=j6 


n=40 


n=oO 


7oU 1 


3 


1 


•jUOCi 0 


•■> 


2 



Notes : 

^Cona2 denotes the contribution of minor ability influencing test items. 

r^^ denotes the number of items influenced by a particular minor ability 
J denotes the number of examinees simulated, 
n denotes the number of items in a test. 
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Rpj^Thon Ratps ppr 100 Trials for liif Case ]• One Al.^joraiu] Spv^ral Miiir-r 
abilities influence test items, d p = 1. Conal.' ^ = 0.2. r ^'= 5. a = .05 



TESTS 



n='J:) n=3U n=40 n=50 



750 


11 


1 


3 


1 


■JiJUU 


• »- 
- ( 


15 


3 


1 



Notes : 

^Conal? denotes the contribution of minor ability influencing test items. 

denotes ihe number of hems influenced by a particular minor ability 
T df^notps the number ol examippps simuiated. 
n denotes \h< number of items in a test. 
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Table 3 

Rj^jertion Hates per lOfl Trials for the Pasf 2: One Ma; and One Alinor 
abilities influence test items, d p = 1, Cona2 '' = 0 2. a =r .05 



J TESTS 



n=50 



roll 


5 


•JOOO 


0 



Notvs: 

^Conal' denotes the contribution of minor ability influencing test items. 
J denotes the number of examii es simulated, 
n denotes the number of items in a test. 
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Rcicclion Rnlcs for d - 2. c = 0.2 Logistic Study 
Number of Rejections per 100 Inais 
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